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Abstract. We present several applications of the pathwise Burkholder-Davis- 
Gundy (BDG) inequalities. Most importantly we prove them for cadlag semi¬ 
martingales and a general function <h, and use this to derive BDG inequalities 
(non-pathwise ones) for the Bessel process of order a > 1 and for martingales 
stopped at r, with r in a well studied class of random times. 
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1. Introduction 

In recent years a new method of proving martingale inequalities through pathwise 
counterparts has emerged. This approach, which historically arose from considera¬ 
tions in robust mathematical finance, has in particular been applied to derive the 
pathwise Burkholder-Davis-Gundy (BDG) inequalities: see [16], where in section 2 
one can also find more information on the history of the subject. 

The first goal of this paper is to generalize the pathwise BDG inequalities of [16] 
from discrete to continuous time; specifically we to show that, if A is a cadlag semi¬ 
martingale and $ a very general function, one can explicitly construct integrands 
K, K such that for some constant (7$ the following pathwise BDG inequalities hold 

(1) + (A . A)t, $(xn<C'$$(v^J + (A.A)t; 

this turns out to be easy if $(t) = t but hard in general, even in the case where 
<I>(t) = fP for some p > 1. 

Trivially the pathwise martingale inequalities imply their classical equivalent; 
the second goal of this paper is to show that the pathwise inequalities are strictly 
more powerful than the classical ones, in useful ways. Indeed, we present several 
applications of ([T]), in which we are able to extend the classical BDG inequalities 
beyond their traditional domain of validity. Although we concentrate our attention 
exclusively on the BDG inequalities, it is clear that also for other martingale in¬ 
equalities the pathwise version is going to be analogously ‘better’ than the classical 
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one; in this regard, it is interesting to keep in mind that every martingale inequality 
in finite discrete time admits a pathwise equivalent: see mil]. Let us now review 
our applications of (ED one by one. 

If i? is a n-dimensional Brownian motion started at Bq, X := ||i?||Rn anc0 
= fP with p > 0, the BDG inequalities applied to B imply that for some c, C 

(2) cE$(\/[X]t-) < Eil>(X*) < C'E$(\/[X]t-) for all stopping times r. 


In other words, the BDG inequalities hold for such a process X, even if X is not 
a local martingale; X is called an n-dimensional Bessel process. More generally 
(but without making a connection with Brownian motion) one can define the a- 
dimensional Bessel process X for all a G M. This is a positive Feller process with 
continuous paths, and it is a semimartingale if a ^ (0,1), so it is natural to ask for 
which values of a ^ (0,1) the BDG inequalities hold. This questions was answered 
by 0 Theorem 4.1], where one can find a proof of the fact that (0) holds if a > 1 
(the details being spelled out just in the case p = 1). We will show how this is 
just a corollary of the pathwise BDG inequalities; so, while the ideas of 0 yield 
‘good’ constantjl and ours do not, our approach has the advantage of simplicity. 
We should say that throughout the paper we make no effort to get good constants; 
the problem of finding the optimal constants is important and still mostlj@ open. 

It turns out that hold not only when r is a stopping time, but also for many 
random times. Indeed, if r is a finite random time such that {K-X)t and {K-X)r are 
in and have zero expectation, trivially ED implies that (0). This can be useful, 
since given any random time r the set of M G := {N is a martingale and G 
L^(P)} for which E[M^] = 0 is ‘large’ (it has co-dimension 1 in H^) and can be 
quite explicitly characterized: see El Section 3]. Moreover, perhaps surprisingly 
there are quite a number of interesting examples of random times r (called pseudo 
stopping times) which are not stopping times and for which EM,. = 0 holds for 
any M G H^; these times have been studied in EQ], where one can find several 
equivalent characterizations and examples. 

If T is a finitj^ stopping time and At := I[T-_oo)(i) then /(r) = f{s)dAs-, it is 
then natural to ask if one can generalize (ED and obtain that 

pCiO pOO pOO 

(3) cE / ^^/[r[,)dA, < E / <^{x:)dAs < CE / ^^/[^s)dA, 

Jo Jo Jo 


^We recall that, while in general the BDG inequalities only hold for p > 1, they hold for any p > 0 
for continuous local martingales. The inequalities also hold for very general functions for the 
cadlag (resp. continuous) case one can take $ as in Theorem [5] (resp. Theorem 0. 

^Meaning constants with the appropriate scaling in a. 

^If ^(f) = tP for some values of p the optimal value of c or C is known, see m- 
"^This is not really needed, as we will see. 
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holds for any local martingale X, increasing adapted A with Aca = 1 and general 
It turns out that this is true and simple to prove, although (perhaps surprisingly) 
this follows not integrating the pathwise BDG inequalities but rather considering 
the classical ones on an enlarged space; this observation is probably not new, al¬ 
though we include it since were not able to locate a reference in the literature. 

One can ask whether the BDG inequalities hold not only for local martingales, 
but also for semimartingales which admit an equivalent local martingale measure; 
the latter processes being of particular importance in mathematical finance, due to 
the Fundamental Theorem of Asset Pricing (see [6]). As proved in [241 [25] . one can 
exactly characterize the equivalent measures under which the so called ‘weighted 
BDG inequalities’ hold. Indeed, given P ^ P let Zt := E[dP/dP so tha1@ 
Zt = exp(Mt — [M]t/2) for a unique local martingale M with Mq = 0. Then, as 
one can read in [121 Theorem 3.17 and 3.18], if the underlying filtration is such that 
every P-martingale is continuous, M G BMO{¥) iff there exist c, C such that 

(4) cE^/l^ < < Ct^/\X]^ 

holds for every local P-martingale X. While we cannot use the pathwise BDG 
inequalities to obtain with a simple proof the above extremely satisfying result in 
complete generality, we can easily prove a weaker statement which does not require 
any knowledge about BMO-martingales. 

Finally, we briefly discuss what happens to the pathwise and the standard BDG 
inequalities in higher dimension (finite and infinite). 

The outline of the rest of the paper is then as follows. In Section [2] we intro¬ 
duce most of the notations and we derive the pathwise Davis inequalities for cadlag 
semimartingales from their discrete time version. In Section [3] we present an alter¬ 
native and direct proof for the case of continuous semimartingales. In Section|4]we 
derive the pathwise BDG inequalities for cadlag semimartingales from Davis’ ones. 
In Section [5] we prove the BDG inequalities for the Bessel processes. In Section [6] 
we show that the BDG inequalities hold for martingales stopped at many random 
times, and in Section [7] we discuss dH). In Section [8] we discuss what happens after 
a change of measure, and finally in Sectionj^we discuss the multidimensional case. 

2. Pathwise Davis inequalities for cadlag semimartingales 

We now easily obtain a version of the pathwise Davis inequalities for cadlag 
semimartingales by passing to the limit their discrete time version; before how¬ 
ever, let us introduce most of the notations used throughout the paper. We will 
work on an underlying filtered probability space (fl,F, (At)t>o,P) whose filtration 


^See [3 Chapter 2, Theorem 8.3] and 1231 Chapter 8, Proposition 1.6]; the fact that M is a local 
martingale follows from dM = {Z—)~^dZ, since Z is a martingale. 
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satisfies the usual condition^- Given cadlag adapted processes S,X,A, and 
assuming that X is a semimartingale, A is of finite variation (on compact sets) 
and the following integrals exist, we will use the following notations. The cag pre¬ 
dictable process S- has value St- := lim„-|-t Su at time t, the jump of S' at t is 
AS* = St — St-, the running maximum S* of S is given by Sf := supjj<( |5.|, [X] 
is the quadratic variation of X, Aaa is the (possibly infinite) limit limt_>oo At (which 
always exists if A is increasing), {H■ X)t is the stochastic integral HudXu, and 
J* H^dAu (resp. HudAy,) is the Lebesgue-Stieltjes integral J^^^^HydAy (resp. 
/[o t) Hud Ay). Given arbitrary processes K, K we will write that K < K ii Kt < Kt 
holds P a.s. and for all < > 0, and we define by convention Kq- := 0 and 0/0 := 0, 
so that in particular Xq- = Xq_ = [A]o- = 0, the integrands H in Theorem [T] and 
in Theorem [5] are well defined and the measure dX* (resp. d^[X].) has 
mass Xq = |Ao| (resp. y^[A]o = |-^o|) at 0. 

Here come the pathwise Davis inequalities for cadlag semimartingales. 


Theorem 1. Let (Ait)jg[Q qq) be a cadlag semimartingale, and set 

(5) Ht := , - . 

+ ( A /_)2 

Then H is cag, predictable, has values in [—1,1], and satisfies 

(6) y^]<3X*-{H-X), X* <6y^]+2{H-X); 


Proof. Applying [161 Theorem 1.2] to Xi := 0 < i < N with 2^t < N 

gives 

( 7 ) Q^iX)t := < 3M-{X)t - {H- ■ X)t , 

y ZGN 

where M'^{X)t := 1^/2^ I and 

(8) g; := i+iTs) ^ =. 

Passing to a subsequence (without relabeling) we get that Q^{X)s —>■ [Ajg P a.s. 
for all s > 0; since X is cadlag, M"‘{X)b —>■ A/ and sc0 i?” —>■ i/s P a.s. for all 
s > 0. Since |i?”| < 1, using the stochastic dominated convergence theorem we 
can take limits in 0 and obtain that P a.s. yJ[X]t < 3A/ — {H ■ X) t; since t was 
arbitrary and all the processes involved are cadlag the inequality also holds P a.s. 
for all f > 0. The proof of the second inequality is identical. □ 


^Meaning it is right continuous and Tq contains all the negligible sets of Too ■ 
Af [JCJs- -b = 0 then = Hs = 0. 
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The traditional Davis inequalities OT are a simple corollary of the pathwise 
ones. 

Corollary 2. Under the assumptions of Theorem{l\ 

(9) {H ■ X)* < 3{X* + ^/[X]) and [H ■ X] < [X]. 

If X is a local martingale then so is {H ■ X), and 

(10) Evl^ < 3EX^, EX^ < 6E^/f^; 

if moreover Ey^[X]oo < oo then [H ■ X) is a martingale and {H ■ X)'^ G L^(P). 

Proof. It trivially follows from dH]) that —3-\/[X] < {H ■ X) < 3X*; since \H\ < 1, 
[H ■ X] = ■ [X] < [X], (l9|) hold. Assume now that X is a local-martingale, 

in which case also H ■ X is a. local martingales (because H is cag). Let Tn be 
sequence of stopping times which localizes H ■ X\ applying ([6]) to X"^" (instead 
of X), taking expectations and then taking limits for t,n ^ oo we get (fTU)l by 
monotone convergence. If E-\/[X]oo is finite gives that E(iL • X)^ < oo, so 
the dominated convergence theorem ensures that the local martingale H ■ X is a 
martingale. □ 

3. Davis inequality for continuous local martingales 

We give here an alternative statement and derivation of pathwise Davis’ inequali¬ 
ties for continuous semimartingales; the following treatment builds on m Theorem 
5.1], where the easier of the two inequalities was proved for continuous local mar¬ 
tingales starting at zero. The proof in this section has the advantage of being a 
relatively straightforward application of Ito’s formula. 

Theorem 3. If X is a continuous semimartingale then P a.s. for all t > 0 

/ 0 V \ 

^ (vmuu ■ 

Notice that the functional form of the integrand in Theorem[3]is slightly differen1@ 
from the one obtained in Theorem [TJ The next lemma is just a slight modihcation 
of the arguments after equation (4.3) in [16] . 

Lemma 4. Let f,g : K’*' —>■ R’*' be continuous increasing and such that f(0) > 0 
and g{0) > 0. Then on K’*' 

( 11 ) + 

^We conjecture that |16l Theorem 1], from which Theorem^follows, also holds with the integrand 
hn = Xn/{y/[x]n V fc* ) and potentially different constants, although proving this would require 
much longer computations. 
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Proof. Since = --^dx, by a change of variables / d{f V g) = f g^ dj^ 
and integrating the latter by parts we obtain that the middle term in (llip equals 


( 12 ) 


d{g^ - P) 
fyg 


As easily shown with the argument^ after equation (4.3) in [TB], (1121) is always 
smaller than Sg — 2/. Applying this inequality with the role of / and g reversed 
and then multiplying by —1 we see that (fT^ is bigger than —(3/ — 2g). □ 


Proof of Theorem\^ For e > 0 define := ([x \ +%yx- '■= (H^ ■X)t. When 

£ > 0 applying Ito’s formula to and 1/(([A] + e) V X*) we find 




-A2(i(([A]+e)VA*) 


([A]+£)VA* (([A]+e)VA*)2 

In other words for £ > 0 the integral /f eaualJ^ 


2A dX + d[X] 
([A] + £)V A* ■ 



s=t rt X^d(^{[X]+e)\/ X*'^ 

/■* (7([A]-|-£) 

([A],+£)VA/ 

S= 0 ^ Jo (([A]+£)VA*)2 

Jo ([A]+£)VA* 


Since Ag = Ag^, the quantity in (fT^ trivially gets bigger if we replace each occur¬ 
rence of A by A*, so applying Lemma|3]to f{t) = [A]t -|- e,g{t) = X^ gives 


r < 3A* -2([A] -P£). 


To pass to the limit as £ —>■ 0 notice that if [Aj^VAj* = 0 then At = 0, so = 0 and 
= 0 (since by our definition 0/0 = 0); if instead [A]t V A/ > 0 then —>• 

is trivially true. In summary, the stochastic dominated convergence theorem gives 
that 7® —>■ 7° uniformly on compacts in probability as £ —^ 0, so there exists some 
£„ —?> 0 for which 7/" —>■ 7° a.s. for all t, proving that 7° < 3A* — 2[A]. 

To prove the opposite inequality, replace A with A* in m in both occurrences, 
and call the resulting quantity; then Lemma S] applied to / = [A] + e,g = X* 
yields > 2A* — 3([A]-|-£). The thesis 7° > A* —4[A] then follows taking £„ —>■ 0 
as above if we can show that P — > —A* — [A]. To prove the latter, let us 

bound separately the two terms C^, whose sum gives P — . First 

Xf - Xf 0-X*^ _ 

■ ([a]* + £)va; - ([a], + £)va/- *■ 

For the second term, notice that if /, g are continuous increasing then 


(14) d(/Vg) = l{/>g}d/-7l{/<g}(ig; 


^Unlike [16| . our / and g are strictly positive; however this has only the effect of slightly simplifying 
the calculations. 

^^We also use the trivial fact that d[X] = d{[X] + e). 
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applying this to / = [X] + e and g = X*, and using that Xf = (X^)^ holdJ^ for 
dXf a.e. t, we get that 

j . r*x^-x*~ 


:= 


/ 


/ \ r 

— d(i[X] + e) V X*) = liixHe>x^}d[X], 


/o ii[X]+e)VX*r ' J a^]+e)^ 

which is bounded below by — [^]t since the integrand on the right hand side is 
bounded below by —1. □ 


4. PATHWISE BDG INEQUALITIES FOR CADLAG SEMIMARTINGALES 

In this section we modify ideas of A. Garsia to show that the general (pathwise) 
BDG inequalities are a consequence of Davis’ ones; this approach was already taken 
in m in the (technically much simpler) discrete-time case when $(t) = for some 
p > 1. For expositions of Garsia’s ideas we refer to [HI Pag 101 to 106] or [5]; a 
slightly modified versiorj^ is given in [T] Chapter 7, Lemma 91]. 

Notice that we do not derive the general pathwise BDG inequalities passing to the 
limit the discrete time statement |161 Theorem 6.3] as done for Davis inequalities. 
The problem with this approach is that for p > 1 it is not easy to show that 
the discretized integrands corresponding to ([5]) should converge to their 

continuous time equivalent, since iL" can be written aJ^ /p K^dY^, where also 
the integrator P" depends on n (and similarly for G"). 

We will henceforth consider a function (j): 1R+ —>■ 1R+ which is cadlag, increasing, 
unbounded and such that (/)(0) = 0; as usual we define (/'(O—) := 0, so that in 
particular dtj) has no atom at zero. The integral 4>(t) := (j){s)ds is a convex 
increasing function such that —>■ oo as t —>■ oo. We will also assume that 

4)(t) is ‘tame’, i.e. that there exists some constant G$ such that $(2t) < G<i)4)(t) 
for all t; equivalently, there exists some constant such that (f>{2t) < c^(j){t) for all 
t. Such functions $ are well studied in connections to Orlicz spaces, and are often 
called ‘Young functions’, although they are also referred to by various other names. 
In particular the interested reader should consult m. where $ would be called an 
‘N-function satisfying the A 2 -condition’. One can then show that the ‘exponen10’ 

(15) p:=sup—— , 

u>0 $(u) 

is in (l,oo). Moreover if '(/;(t) := inf{s : (j){s) > t} is the cad (so tp-it) := 
is the cag) inverse of </), and 'li(t) := f*'ip{s)ds is the convex conjugate of $, the 


^^Indeed O := {t > 0 : X* > |^t|} is open in K, so it can be written as a countable union of open 
intervals; on each of these X* is constant, so dX* is supported by R+ \ O = {f > 0 : X^ = |Xt|}. 
^^The quantity E[$(|X|)] is replaced by the seminorm ||Ar||$ := inf{A > 0 : E[$(|X|/A)] < 1}. 
l^Indeed Y" = <I>(VQ"A)o) 

$(t) = then p in II15II equals c. 







PIETRO SIORPAES 


following inequalities hold: 

(16) uv < $(u) + 'I'(ii), $(aM) < if a > 1, il'(au) < a'^{u) if a < 1, 

'I'(s) < (p — 1)^’(V’('S“)) and ip-{(j){s)) < s and so di((/)(s)) < (p — l)‘h(s) 

Theorem 5. Assume that (^t)te[o,oo) ® cadlag semimartingale, $ is as above 
and p is given by dig. Let C (resp. D) be the cad inverse of \/[X] (resp. X*) and 
define Cp := p(QpY, 

Ht:= f _ Gt := [ pF^^^'>df>{s), 

where s<t is defined as 

piA — Xs- _ 

^[X]t- - [X],_ + sup,<„<,(X„ - ■ 

Then H, G are cag predictable and 

(17) ^^/[X]) < Cp<i>{X*) - {H ■ X), $(X*) < Cp$(v1^) + 2(G-X). 

Moreover if (/) is continuous then H and G are lad and 

(18) Ht= [ Gt= f pFl^U<t>{Xl). 

J[o,t) Jlo,t) 

We will derive in (l28)l integral expressions for Ht+ and Gt+; these show that if X 
is continuous then also FI and G are continuous. Notice that the function $(t) = t 
does not satisfy the assumptions made on $ in Theorem[5] despite of this, Theorem 
[T] affords the equivalent of (IT71) . 

Of course, given a sequence of real numbers (xn)n>o and the probability space 
{w} made of one point, applying (ITTl) (resp. ([6])) to Xt(uj) := Xnl[n,n+i)(l) we 
obtain pathwise BDG inequalities for functions of a real variable, which if <i>(t) = t^ 
reduce to m Theorem 6.3] for p > 1 (resp. to [TBl Theorem 1.2] for p = 1). 

Moreover, if (j) is continuous the integrands FI and G in Theorem [5] are caglad, 
so ini) are really path-by-path inequalities. Indeed, for P a.e. ui one can compute 
{H ■ X).{u}) and [X].(a;) := X'^{uj) — (2X_ • X)(a;) only making use of Hfuj) and 
X.(uj) by taking limits of Riemann sums computed along appropriate sequences of 
hitting times fseJ^ [21 Theorem 7.14] and [H]). Unfortunately this remark does 
not apply to Davis inequalities ([6]) , since F[ in Theorem |T] is not necessarily lad, 
not even if X is continuously differentiabli^. 

The traditional BDG inequalities are a simple corollary of the pathwise ones. 

^®One can apply the cited theorem since := is adapted (since such is H, and the 

filtration is right continuous) and = Ht- 

If Xt := Usin(l/t) for t > 0 and Xq := 0, X is and thus has finite variation, so it is a 
semimartingale with [X] = 0, and Ht = Xt—jX^_ keeps oscillating between 1 and —1 as t J, 0. 
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Corollary 6. Under the assumptions of Theorem\^for all y S [i, 1] 

and if X is a local-martingale then so are {H ■ X) and {G ■ X), and 

(20) E<l>(v1^ < CpE$(x;,), Eil>(X^) < CpE$(\/f^). 

In particular if X is a local-martingale and E<i)(y^[X]oo) < oo then {H ■ X)t and 
(G • X)t are martingales and {H ■ X)%^, (G • X)l^ S L^(P). 

The rest of this section is devoted to the proof of Theorem [S] and Corollary [51 

Proof of Corollary[^ Since for increasing positive A, B, with cad B, we have AdB < 
AtBt, and since < 1 implies that H'^ < p^^(v^[X])^, using (ITSl) we get 

[H-X]=H^- [X] < ■ [X] < p^f>{^])^[X] < p2(p$(y^))2. 

Analogously for G we can write, for any y > 0 

< ^pmx*Y ■ [X] < pcf{X*)^] = {pyVm) ; 

when 2 / G [1/p, 1] we can apply the inequalities (fT51) and get 

(pyVm) (^) < ■iipyXU) + ® (^) < (pymvm) + Atipi, 

now bound the last term above using that ^'((/(s)) < {p — l)$(s); putting the 
inequalities together concludes the proof of (fTOl) . 

If X is a local-martingale, working as in Corollary [5] gives the thesis: the only 
difference here is that E(7J • X)^ < oo follows from E$(a/[X]oo) < oo since (HID 
gives that Ky/[H ■ X]oo < oo and we can then apply (fTOll to H ■ X (instead of 
X). □ 

Proof of Theoreml^ Step 1: H,G are cag predictable. Since {Cs < t} = {s < 
\/[X](_}, is defined on [0, \/[X]^_) and setting := 0 for s > t we ge10 

I'OO poo 

Ht := / Gt := / pF^°^U(f{s). 

Jo Jo 

Denote with Ft ® B the product sigma algebra of Ft with the Borel sets B of 
R+; we will now prove that (f/‘"'’^) sg[o_oo) is F* 0 measurable, so that Ht is Ft 
measurable, i.e. H is adapted. Since \/[^ is adapted, Gs is a stopping time and so 
Gs A t is Ft measurable, and so since G. is cad the map Z{oj, s) := (u, Gs(cj) A t) is 
Ft®B/Ft®B measurable. Analogously (but using left continuitjo of (F/*^)sg[o.t) 
and the fact that f/®^ = 0 for s > t) the map f/’^ is Ft 0 F measurable, and 

write f^ for which is the same as U since by definition <t>(oo) = limt_>oo 'l’(i). 

^®We warn the reader that for s f f the limit of may not exist (so we specified s £ [0,f)). 
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iC I I-I {G I 

thus so is the composition ’ o Z. Since F^ G [—1,1], the dominated 

convergence theorem implies that H is cag (so predictable): indeed if ^ and 
Mf” := supc,<„<t \X^-Xc,- \ > 0 then trivially Mf'’ andFj^f'’^ Ff‘\ 

whereas if = 0 then trivially = 0 and so 0 = F^^”^ —>■ F^^‘^ = 0. The 
proof that G is adapted and cag is analogous. 

Step 2: X satisfies < Cp^{X*) — (H ■ X). Since Cs is a stopping 

time := Xc^+t — Xc^- is a semimartingale (w.r.t. the time changed filtration 
:= Fc^+t) and satisfies 

( 21 ) y[x]^t-./[x]^ < vmc.+t - 

Define f{x, q, s) := xj\Jq + and apply ([B]) to (instead of X) to get 

( 22 ) where :=/(ri"\ ri"^*). 

Writing the integrals as limits (in probability, uniformly on compacts) of Riemann 
sums we get 

(ff(-) . F(-)), = , 

so since using (1^ . (1221) we get 

(23) - /^;+- F^^^^dX^ . 

Since \J[X]^ _ < s, {y^[X]t > s} = {Cg- < t} D {Cg < t} and [X] is constant on 
[Cs-, Cg) (when this interval is non-empty) we get 

{\/[X]t - s)+ < {\/[X]t - \/[-’f]c.-)l{yp^>^j = iVWft - \/[-’^]c.-)l{c.<i} 

which, combined with (|23ll and with X^l^c^<t} < 

(24) (4^ - s)+ < f Fjf^Ux^. 

J Cs 

Integrating (1^ over s G [0,oo) with respect to d(j){s) and using the identities 
= /o°°(^ “ s)+d(/)(s) and l{c.<t} Jc, F^^^'^dXu = Fi^‘‘h[c.<u}dXu gives 

(25) <^{VWt) < eX.XvW) - 
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Since {Cs < m} = {s < \/[X]^_}, the stochastic Fubini theorem [HI Chapter 4, 
TheorenO 65] gives 

poo pt pt p 

(26) / / F^^^h{c.<u}dX^d(j^{s) = I ^ dcj,{s)dX^. 

Jo Jo Jo 

Now apply the inequalities (IT51) to write 

6x;^i./[xYt) < HGpx:) + < (GprHx:) + vi/(<^(yiT];))/p 


and bound the last term using that 4t((^(s)) < {p — l)$(s); combine the resulting 
inequality with (1251) and (l26ll to get 

$(v1T];)(i - < (6p)J>$(x;) - ^dx^, 

i.e. the first inequality (113- 

Step 3: X satisfies $(X*) < Cp^{^J[X]) + 2(G • X). Proceeding analogously 
using X* and D (instead of \/[X] and C) yields a which satisfies 

x*D,+t - Xh,_ < sup I = r/*)* 

u£[Ds ,Ds-\-t] 


and since \/[F('*1]. < yJ[X]D^+- we obtain 


D^+. - ^ 6\/I^UT + 2 /^;+' Fi°‘^dXu ; 


the proof continues exactly as before, yielding the second inequality C3- 

Step 4: Alternative expression for H,G. If (p is continuous and A = [A], 
the cad inverse g oi cpo A equals C o ip and [p o A){u—) = (j){Au- — ), so 


(27) 


f.(0oA)(«-) 




'(0oA)(O-) 


HAu--} 


4i(Ao--) 




By change of variable the first integral in (IHl) equals Fi'*L^(As), and the 
second one d(j){s)] thus /[q t) A]s) equals Fit. Proceed 

analogously for A = X* and G. 

Step 5: F[ and G are lad. We will use the expression dT51) . writing however the 
integrals over [0,oo) and extending F^^'^ to be zero for s>t. Define the quantities 
Mp := sup^<„<( |A„ — Xs-\ for s < t (so that Mp = |At — At_| for s = t) and 


pis) _ Xt Xs- _ 

^[A],-[A],_ + (M/)2 


for s <t, := 0 for s > t. 


^^As observed after the statement of the theorem, by passing from dfi to fdfi for some / € 
one can prove the theorem for sigma-hnite fi. Saying it differently, one can consider the finite 

(C I 

measure d^{s) = exp(—0(5))d0(s) and apply Theorem 65 to := exp(0(s))F.i] ^'^l[o,Ai4_)('5)- 
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Iftnit and s > t then by definition = 0 = for n big enough (such that 
tn < s). If s < t then trivially —>■ M/, so if > 0 we get —>■ 

however if = 0 it can happer@ that > 0 for all n and that does not 
converge to F^ ’ = 0. We shall now show that this can only happen for s in a set 
of d/i := d{(l>{^/[xY.) + 4>{X*)) measure zero, so —>■ f/*^ for a.e. s and the 
dominated convergence theorem implies that PI and G lad and 

( 28 ) Ht+ = [ pF,^^^dct>{./[xYs ), Gt+= f pF,^^Uct>{X:). 

J[o,t] J[o,t] 

If the set Z := {s : = 0} contains some element s then necessarily it contains 

the whole interval [s, t]; it follows that, for some sq G [0,t], Z equals either [so,^] 
or (so,i]- Since X is cad, if Z 5 Sn i s then s € Z, so Z = [so,t]. Since Xu takes 
the constant value Xsq- for all u S [so,t], [X]ii (resp. X*) takes the constant value 
[X]sQ_ (resp. X*^_) for all u G [so,t], so dp gives measure 0 to [so,t]- 

□ 


5. The Bessel process 

We will now prove BDG inequalities for the Bessel process as a corollary of the 
pathwise Davis inequalities. While Davis inequalities follow easily, to recover the 
general BDG inequalities we do not use Theorem [SJ but rather apply a strength¬ 
ened version of Davis inequality, obtained by a rather delicate modification of the 
arguments in [TJ Ghapter 7, Lemma 91]. 

Theorem 7. Let X he the Bessel process of dimension a G [1, c») started at Xq > 0 
and $(f) = tP for p > 0, then there exist constants c,G such that 

(29) cE$(\/[X]t-) < E$(X*) < C'E$(-\/[X]t-) for all stopping times t . 

More generally, (1291) holds if^ : IR.+ —>■ IR.+ is cadlag, increasing, such that ^{x) = 0 
iff X = 0 and for which sup^^g <I>(,0t)/$(t) < oo for some (and thus all) /3 > 1. 

The only facts about X which we will need in the following proof is that X. > 0 
P 0 a.e. and X is a weak solutior0 of 

(30) dX = -b dW, 

2Jv 

where IT is a standard Brownian motion w.r.t some underlying filtration In 

fact X is positive and it never hits 0 (after time zero) if a > 2, whereas for a G (0,2) 
a.s. X hits zero but the set {s : Xg = 0} has Lebesgue measure zero (for all these 
statements see [23l Page 442]). That X solves (|^ is stated in [23l Chapter 11, 


^*^For example take s = 0 and see footnote 1161 

^^The explosion time of is oo, i.e. the solution to is defined for all t € [0, oo). 
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Exercise 1.26], and if a > 2 the proof is simple and can be found just before [23l 
Chapter 11, Proposition l.lOj. 

Notice in particular that X has continuous paths and [X]t = Xq +1. 


Lemma 8 . Let X he a semimartingale such that X. + C > 0 P 0 a.e. and 

(31) + Xo> 0 , 

where W is a standard Brownian motion w.r.t some underlying filtration {Xt)t, 
C > Q is a Xo measurable random variable and 7 > 0. Then 


(32) 


E(x;|J-o)<(6 + 27)E(v^JJ-o) 


and if C = 0 then 

( 33 ) E{./[Xl\Xo)<m{x:\Xo). 

Proof. Taking Ht '.= Xt/yJ[X]t + {XfY, since \H\ < 1 we get that E Hfds < 
t < 00 , so H ■ W is a martingale and dSU) gives 

(34) E((H . X)„,|^o) = E (f‘ . 

In particular, since [X]t = Xq + t and X./{X. + C) < 1 P 0 a.e., deleting the 

positive X* term from (IMl) we can bound E((H ■ X)r/\t\^o) from above with 


E 



7 

2 V -^0 + ® 


ds 



= 7(E(^X2 + rAt|J-o) - jXol) < 7 ]E(\/I^W|.Fo) . 


Now evaluate the second inequality ([61) at time t A t, take E(... jj^g): apply the 
bound we proved for E((R ■ X)T-At|J'o) and take limiti^ as t —>■ 00 to get (15^ . 
If C = 0 then X. = X — C > 0 P 0 a.e., so trivially from (IMl) we get the 

bound E((H ■ X)r/\t\^o) > 0) so (l33l) follows evaluating ([ 6 ]) at time rAt and taking 
limt_>ooE(... jXg). □ 


Proof of Theorem^ The case a G N\{0} follows from the analogous statement for 
Brownian motion (for which we refer to |151 Page 37] for generao ‘h and dimension 
one; for higher dimension see our Section 19]): let us show this in detail. If n S 
N \ {0}, B is a n-dimensional Brownian motion started at Bq and X := USllRn 
then [X]t = llBollIn +t and [B]t = lli?oll|n + nt, so [B]/n < [X] < [B]. Thus, 
since Xf = supg<( jjBgjjRn, the BDG inequalities applied to B imply those for X. 

From now on we can then assume a > 1. Since X solves dSOl), we can apply 
Lemma [5] to X with C = 0 and 7 = 0 —!, and taking expectations gives the thesis 
for <I>(t) = t. So, let us consider the case of general $ and a > 1. If cr < ct are finite 


^^Use the monotone convergence theorem. 

^^The statement in the special case ‘t’jt) = can be found in most books on stochastic calculus 
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stopping times then Wt := Wa+t — Wa (resp. a — a) is a := Tcr+t Brownian 
motion (resp. stopping time), and Xt := trivially satisfies dX = ^^dt + dW. 
We can then apply (l33|) with {X, W, F, r, C, 7 ) := {X, W, F,d — a,0,a — 1) and 
combine this with the bounds X^ < X*_^_^ and 

- ^[X]A{.>0} < ^[X]a+t - [X]. 1 {.> 0 } = < VS 

to obtain 


(35) 


E(vW- vWl{a>o}l^<.) < 3E(l;_J.Fo) < 3E(x;|j-<,). 


Define the localizing sequence cr„ := inf{t > 0 : X^ + [X]t > n} A n and, given 
stopping times t, f, 9 with t < t, define a := an /\ 9 /\ t, a := an A 9 A t. Since 
a < a < 00 and we can apply (1351) to get 

< 3E(X;|J-.). 


Since and Xi are J'cr„A 6 i measurable and E(y|J^cr) = 

E(E(y|J^CT„Ae)l-^T) for any positive r.v. Y, we obtain that 

E(y^ix^. - ,/ix^llfn>0}i^r) < 3E(X||J-.) . 


Integrating this over {t > r} S T-r gives, since 

E(^/Jx^.l{,>o} - ^\^^]l{r>e}) < 3E((X""^^);i|,>.}). 

It follows from [m Lemma l.I] that for some constant c 

E$(x/m;i^) = Eci>(Y^ix^^) < cE$(x"'‘^^):, = cm{x:^^,) , 

and so taking n —>• 00 gives E<i>(Y^[X]e) < cE<i)(Xg). 

To prove the opposite inequality unfortunately we cannot work analogously with 


X, because we would need to use that y [X]t < y [Xj^r+t, which is not true (indeed 
— [X]n + X^). We consider insteaco It := X^+t —-^ctI{(t>o}) so that 
a — 1 


(36) 


dY = 


-dt + dW. 


2 (r + x,i{,>o}) 

Applying ([31]) with (A, IT, J", r, C, 7 ) := (F, W, A, d - tr, A^l{^>o}, a - I), and 
combining this with the bounds \/[Y]t < y^[A]cr+t and 


a:+, - a:I{„>0} < y ; 


^^Analogously we cannot consider Y instead of X in the previous proof, since the corresponding 
inequality in Lemma [8| only holds for C = 0. 
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gives, for m := (6 + 2(a — 1 )), 

(37) E(X; - < mE{^[Y]a-a\To) < mE{y^\T ,), 

which is the equivalent of (1551) (with X* and -\/[X] reversed). The proof now 
continues exactly as above. □ 


6. Random times 

From Theorem [5] and Corollary |6] (resp. Theorem [T] and Corollary [2]) it follows 
that the BDG inequalities ([2]) hold for any pseudo stopping time r and local mar¬ 
tingale if $ is as in Theorem 0 (resp. if $(t) = t); to see this, one first has to 
localize X so as to make H ■ X and G • X in , then take expectations and then 
limits (using the monotone convergence theorem). 

Although the above extension to pseudo stopping times had already been proved 
in [TU Propositior@ 2 ] with change of filtration techniques, it is convenient that it 
follows automatically from our approach; moreover, as mentioned in the introduc¬ 
tion, we are able to obtain yet another setting in which ([ 2 ]) holds, and this seems to 
be new. Indeed, one can go the other way around and, given an arbitrary random 
time r, study the subspace Si{t) of M G for which EMr = 0. The above 
discussion shows that iO H ■ X £ Si{t), where X is a local martingale and H is 
as in Theorem [l] then ([^l hold for $(t) = t; analogously for H,G from Theorem 
O and a correspondingly general <&. As we already said, this can be useful since 
5i(r) is ‘large’ and can be quite explicitly characterized: see [HI Section 3]. This 
works out particularly well when r is an honest time; for example one can show 
that if is the filtration generated by a one dimensional Brownian motion B 

and T := sup{t < a : Bt = 0}, where a is the first time B hits 1, then r is an honest 
time and if (Tt)t denotes the local time at zero of B and £"(a:) is the Laguerre 
polynomial then Mt := E[£„(Lcr)|Xt] is in 5i(t) whenever n 1: 

see [l8l Example 3.7]. 

7. Randomized stopping times 

In this section we prove that for $ as irF^ Theorem [S] (and also for $(t) = t) 
we have ([3]) for any local martingale X and randomized stopping time A; but first, 
we need some more definitions. We say that A is a randomized stopping time if it 
is a cadlag increasing adapted process with Aq = 0 and limi_,.oo At < 1. Breaking 
from our conventions, in this section we allow A to have a jump at infinity: we 
will write Aqo- for limt-too At, and we define A^o ■= 1 (however X^ is defined as 

^^This has a typo: p should be > 1. Only if M is continuous one can take any p > 0. 

^^Because of Corollary [2] it is clear that H ■ X is in if one (and thus both) of the quantities in 
unii are finite; what it not clear is whether K{H • X)r = 0. 

X is continuous then one can even take $ as in Theorem [3 
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usual as limt_>oo , and analogously for [X]oo) and if / : [0, oo] —>■ [0, oo] is Borel 
then f{s)dAs := f{s)dAs. If t is a stopping time (not necessarily finite) 

At := 1 [t- oo)(t) is a randomized stopping time, and /(t) = f{s)dAs. 

Let us now prove ©; as we just said, the integrals are over (0, oo]. If the 
underlying space is (fl, (J^t)t,P), consider the enlargement l) := 11 ig) [0,1] endowed 
with the product probability P := P (g and the usual augmentation of 

the filtration (J) (g B)t (where denotes the Lebesgue measure and B the Borel 
subsets of [0,1]). If is a localizing sequence for X and Y a process on 11, we 
extend them to Cl by setting for all t 

fn{uj,s) := Tn{oj), Yt{uj, s) := Yt{uj) for all w G n,s G [0,1]. 

Let C{oj, s) := inf{t : Atiuj) > s} be the cad inverse of A; then trivially fn are {Xt)t 
stopping times localizing the {Xt)t local martingale X, and C is a {Xt)t stopping 
time since {C < t} = {(w, s) : s < At-^uj)} is in Xt®B. In particular we can apply 
the BDG inequalities (fTOjl and (l20l) to the local martingale X stopped at C, and 
then use the change of time formula 

EFc := / %dF = E /g Ycjs = E Y,dA, 

with Y = <i)(-\/[X]) and Y = <i>(X*) to obtain ([3]). 

8. Change of Measure 

As we mentioned, the proof of the equivalence M G BMO{¥) (|4]) is not 
simple; in this section we show how to easily get the following weakei@ statement. 

Theorem 9. For every s,T > 0 there exist c, C sueh that 

(38) dR^/\xYr < EXjl < CEyi^ 

holds for all stopping times r and X satisfyin^^ dX = aX{dW + fidt) on [0, r] for 
some predictable a, p, such that \p\ and \ap\ are bounded by s and p = 0 on (T, oo). 

Proof. Since X satisfies dX = aX{dW + pdt), X is continuous and /g^^ a'^X'^ds < 
oo a.s. for all t < oo (otherwise f^aXdW is not defined on [0,t]). Thus, if 
Fit ■= Xtj^\X\t + (Xf)'^ G [—1,1], Mt := HaXdW is well defined and a 
local martingale, and so there exist a localizing sequence (t„)„ for it and we get 

pTnAr pT At 

\E{H • = |E / HaXfidt\ <sE |X| dt < sTEX^, 

Jo Jo 

this statement the role of P and P is reversed, and P is chosen so that dW := dW + fj^dt is 
a P-Brownian motion. Given M := the local P-martingale exp(M — [M]/2) is in BMO{P) 

since it has bounded quadratic variation, so this theorem is a special case of the result above. 
^^We are not assuming that this SDE has solution with explosion time strictly bigger than r: this 
will depend on cr. We are saying that, if (^t)t^[o,oo) is a such a solution, then (|38ll holds. 
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and thus applying ([6]), localizing, taking expectations and passing to the limit we 
conclude E ^ y [ X ]^ < (3 + sr)EX*. Analogously since by Holder inequality 


|E 


pTnAr ^OO / i-T 

J HaX^dt\<sE J l[Qj^-^{l[Q^T.]\aX\)dt<sEVTJj a'^X^ dt 


and /J" a'^X^ dt = [X]t, we can conclude that EX* < (6 + 2s'/T) Ea/[X]^ . □ 


The reader may be interested in knowing that there are several conditions equiv¬ 
alent to (HI) which one can impose on Z: the reverse Holder inequality, the Muck- 
enhoupt (Ap) condition and yet another unnamed (Bp) condition (see Theorem 
3.4, Corollary 3.4 and Theoreir0 2.4 in [E])- Moreover, in [3] one can find (under 
additional conditions) an extension of the implication M G BMO{T) (j4|) in the 
case where not every martingale is continuous. 


9. Higher Dimension 

The pathwise BDG inequalities ([B]) and (ITT)) , so far stated and proved in dimen¬ 
sion 1, automatically hold in any finite dimension (with worse constants). Indeed, 
since on R" all norms are equivalent, if 11 • | Ir" denotes the Euclidian norm there exist 
0 < an < /3n such that for every n-dimensional semimartingale X = (X*)i=i_,,,_„ 

n n n 

anY.{Xrt=cyuY.sup\Xl\<X: := sup||X||Rn <^„^(X*):. 
i=l i=l i=l 

Thus the fact that Davis inequalities (H)) hold for X follows summing over i the 
corresponding inequalities for X* (since [X] = -X) := ■ X^)). 

Similarly one obtains (HZl, using also the fact that for all > 0 

I n n n 

-Y.hu) 

i—1 i—1 i—1 

these inequalities hold since $ is increasing, convex and satisfies $(n<) < nP$(t). 

Unfortunately, all this falls short of what one can do with the classic BDG 
inequalities, for which one can not only apply the above reasoning, but also prove 
that automatically they hold for every martingale M with values in a Hilbert space 
H, and with the same constant as for R.^. In fact, one can easily construct (possibly 
on an enlarged probability space) a R^-valued martingale N such that ||Mt||ij = 
||Xt||R 2 and [M]t = [X]t for alH > 0. For the simple proof of this nice yet not so 
well-known result of [10] in the discrete time case see [Ml proposition 5.8.3]. For 
the general (much harder) cadlag case one can consult [TU] ; notice however that one 
does not need this to prove the BDG inequalities for cadlag martingales, as these 
follow from their discrete time version! 


^®This Theorem has a typo, it should be p > 1 not p > 1. 
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